Using a Commercial Speech Recogniser Within the Domain
نویسندگان
چکیده
We have taken an off-the-shelf, commercial continuous speech recogniser and conducted tests using three syntaxes for the domain of Air Traffic Control. The syntaxes are based on a corpus of transmissions between the ATC and pilots and reflect three differing levels of "linguistic" knowledge. The first represented the system where, in effect, there would be no syntax but a lexicon of all words in the corpus. The second took a partial look at syntactic information by using a key phrase spotting mechanism. The third represented the entire syntax of the corpus. Initial experiments show that key phrase spotting is insignificantly more accurate than no syntax at all, whilst use of a complete syntax can improve performance, to a point. The benefits of a discourse grammar are briefly discussed.
منابع مشابه
Integration of an On-line Kaldi Speech Recogniser to the Alex Dialogue Systems Framework
This paper describes the integration of an on-line Kaldi speech recogniser into the Alex Dialogue Systems Framework (ADSF). As the Kaldi OnlineLatgenRecogniser is written in C++, we first developed a Python wrapper for the recogniser so that the ADSF, written in Python, could interface with it. Training scripts for acoustic and language modelling were developed and integrated into ADSF, and aco...
متن کاملIntegrated Recognition and Interpretation of Speech for a Construction Task Domain
The development of speech processing front-ends for the controlling of complex systems has received more and more interest during the last years. Usually this task is divided in two subtasks. The speech recogniser records the utterance and puts out a corresponding text, and the speech understanding module tries to extract an internal representation of the meaning of the utterance. As shown in F...
متن کاملSpeech Recognition of Phones Using Feature Streams
Artiicial neural networks (ANNs) have been used to classify phonetic features in speech. The feature streams from the ANNs are used here as the observations for Hidden Markov Models (HMMs). Using such observations allows us to build a competitive speech recogniser. This recogniser is compared to a similar recogniser that was trained on mel-frequency cepstral coeecients (MFCCs). While the cepstr...
متن کاملA high-level approach to confidence estimation in speech recognition
Errors in the output of a speech recogniser can be said to be due to the interaction of inadequate phonetic and language modelling components. We investigate an approach to estimating confidence scores for the words output by a recogniser in which the language modelling and acoustic modelling are decoupled by the use of a phone recogniser working in parallel with the word recogniser. An advanta...
متن کاملFactors affecting speech retrieval
Collections of speech documents can be searched using speech retrieval, in which the documents are processed by a speech recogniser to give text that can be searched by standard text retrieval techniques. Recognition is the translation of speech signals into either words or subword units such as phonemes. We investigated the use of a phoneme-based recogniser to obtain phoneme sequences. We foun...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996